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(57) Abstract 

A Web based system provides informational service components (80B-80E) for assisting customers in selecting products or other 
types of items from an electronic catalog of a merchant Users of the system can create and join user communities such as communities 
based cm user hobbies, localities, professions, and organizations. The system also supports implicit membership communities that are based 
on email addresses (e.g., all users having a "nasa.com" email address), shipping/billing addresses, and other known user information. Using 
purchase history data collected for online users, the system automatically identities and generates lists (86A) of the most popular items 
(and/or items that are becoming popular) within particular communities and makes such information available to users for viewing. 
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SYSTEMS AND METHODS FOR ASSISTING USERS IN SELECTING 
AND EVALUATING ITEMS FROM AN ELECTRONIC CATALOG 



PRIORITY CLAIM 

5 This application claims the benefit of U.S. Provisional Application No. 60/126,557, filed April 9, 1999. 

FIELD OF THE INVENTION 

This invention relates to electronic commerce and information filtering. More specifically, this invention relates 
to information processing methods for assisting online users in identifying and evaluating Herns from a database of items 
based on user purchase histories or other historical data. 
10 BACKGROUND OF THE INVENTION 

Web sites of online merchants commonly provide various types of informational services for assisting users in 
evaluating the merchants' product offerings. Such services can be invaluable to an online customer, particularly if the 
customer does not have the opportunity to physically inspect the merchants' products or talk to a salesperson. 

One type of service involves recommending products to users based on personal preference information. Such 
1 5 preference information may be specified by the user explicitly, such as by fitting out an online form, or impficruy, such as by 
purchasing or rating products. The personaized product recommendations may be communicated to the customer via an 
email message, a dynamically-generated Web page, or some other communications method. 

Two types of algorithmic methods are commonly used to generate the personalized recommendations — 
collaborative f Btering and content-based filtering. Collaborative ffftering methods operate by identifying other users with 
20 similar tastes, and then recommending products that were purchased or highly rated by such similar users. Content-based 
filtering methods operate by processing product-related content, such as product descriptions stored m a database, to 
identify products similar to those purchased or highly rated by the user. Both types of methods can be combined within a 
smgte system. 

Web sites also commonly implement services for collecting and posting subjective and objective information 
25 about the product tastes of the onfins community. For example, the Web she of Amazon.com, the assignee of the present 
application, provides a service for allowing users to submit ratings (on a scale of 1-5) and textual reviews of individual 
book, music and video titles. When a user selects a title for viewing, the user is presented with a product detail page that 
includes the title's average rating and samples of the submitted reviews. Users of the site can also access Bsts of the 
bestseting titles within particular product categories, such as "mystery tittes" or "jazz CDs." 
30 SUMMARY OF THE INVENTION 

One problem with the above-described methods is that they fail to take into consideration the level of acceptance 
tin merchant's products have attained within specific user communities. As a result, products that are very popular within 
the communities to which the user belongs or. is affffiated may never be called to the user's attention. For example, a 
programming book that has attained disparate popularity among Microsoft Corporation programmers may never be called 
35 to the attention of other programmers, including other programmers at Microsoft Even where such products are known to 
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the user, the user's ignorance of a product's level of acceptance within specific communities, and/or the user's inability to 
communicate with users who are familiar with the product can contribute to a poor purchase decision. 

The present invention addresses these and other problems by providing various computer-implemented services 
for assisting users in identifying and evaluating items that have gained acceptance within particular user communities. The 
5 services are preferably implemented as part of a Web site system, but may alternatively be implemented as part of an 
ortfine services network, interactive television system, or other type of information system. In one embodiment, the 
services are provided on the Web she of an online store to assist users in identifying and evaluating products, such as book 
tides. 

The communities may include explicit membership communities that users can join through a sign-up page. The 

1 0 explicit membership communities may include, lor example, specific imvershies, outdoors clubs, community groups, and 
professions. Users may also have the option of adding explicit membership communities to the system, including 
communities that are private (not exposed to the general user population). The communities may additionally or 
alternatively include implicit membership communities for which membership is determined without any active participation 
by users. Examples of implicit membership communities include domain-based communities such as Mkrosoftxom Users 

15 (determined from users' email addresses), geographic region based communities such as Haw Orleans Ana Residents 
(determined from users' shipping addresses), and communities for which membership is based on users' purchase histories. 

In accordance with one aspect of the invention, a service is provided for automatically generating and displaying 
community-based popular hems lists. The popular herns fists are preferably in the form of bestseller lists that are based on 
sates activities over a certain period of time, such as the last two months. By viewing these fists, users can readily identify 

20 the bestseffing products within specific communities. In one embodiment, the bestseller fists for the communities of which 
the user is a member are automatically displayed on a personalized Web page. The bestseller lists could also be 
communicated by email, fax, or another communications method. 

One feature of the invention involves generating bestseller lists that are based solely on Internet domains, 
without requiring any active user participation. These domain-based bestseller fists may be displayed automatically on the 

25 home page or other area of the Web she. 

Another feature of the invention involves generating and (fisplaying bestseller fists for "composite communities," 
which are communities formed from multiple implicit and/or explicit membership communities. Using .this feature, a user 
can, for example, view a bestseller list for the composite community AH US. Bicycle Clubs, or Domains of all Software 
Companies. In one embodiment, users can define their own, personal composite communities (such as by selecting from a 

30 fist of noiwomposite communities) to create custom bestseller fists. 

In accordance with another aspect of the invention, a service is provided for notifying users interested in 
particular products of other users that have purchased the same or seriar products, m one embodiment the service is 
implemented by providing user contact information on product detail pages. For example, when a user raws a product 
detail page for a particular product (such as a kayak), the detail page may be customized to include the names and ema9 

35 addresses of other members of the user's community (such as a kayaking club) that recently purchased the same product 



-2- 



WO 00/62223 



PCTAJSOO/40077 



If any of these other members is online, the user may be presented the option to send an instant message or otherwise 
chat online with such members. In one implementation, users can opt to expose their contact information to other 
community members (and thus participate in the service) on a comrnunity-by-conurainity basts. A variation of this service 
involves notifying users interested in particular merchants (e.g., sellers on an online auction site) of the contact information 
of other users (preferably fellow community members) that have engaged in business with such merchants. 

In accordance with yet another aspect of the invention, a notification service is provided for informing users of 
popular products within their respective communities. The popular products may be identified, for example, based on the 
popularity of tha product within the community relative to the product's popularity within the general user population, or 
based simpry on the number of units recently purchased within the community relative to the number of community 
members. In one embodiment, users can also request to be notified of aU purchases made within their respective 
communities. The popular product and purchase event notifications are preferably sent by email (to community members 
that have not yet purchased the product), but may alternatively be communicated using a personalized Web page of other 
method. The notifications may include information for assisting users in evaluating the products, such as the number of 
community members that have purchased the product and/or contact information of such other users. 

In accordance with another aspect of the invention, the purchase histories of users are processed to identify the 
"characterizing purchases' of a community, and these characterizing purchases are used to recommend items within that 
community. Specifically, the purchase history data of the community is compared to the purchase history data of a general 
user population to identify a set of items purchased within the community that distinguish the community from the general 
user population. Items are then implicitly or explicitly recommended to members of the community from this sel, such as 
through popular items lists or email notifications. 

The various features of the invention can also be used in the context of a system in which users merely view, 
download, and/or rate items without making purchases. In such systems, each viewing, downloading and/or rating event 
(or those that satisfy certain criteria) can be treated the same as a purchase event. 

RBIFF DESCRIPT inM OF THE DRAWINGS 

A set of services which implement the various features of the invention will now be described with reference to 
the drawings of a preferred embodiment, in which: 

Figure 1 illustrates an example sign-up page for specifying community memberships and service preferences; 

Figure 2 illustrates a personaEzed community bestsellers page; 

figure 3 iBustrates an example product (book) detad page which includes contact information of other community 
members that have purchased the product; 

Figure 4 illustrates an example hotseller notification email message; 

Figure 5 is an architectural drawing which illustrates a set of components which may be used to implement the 
community bestseller lists, hotseller notification, and contact information exchange services; 

Figure 6 illustrates an offline process for generating the community bestseller lists table and the product-to- 
member tables of Figure 5; 
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Figure 7A and 7B illustrate an online (real time) process for generating personalized community bestseller pages 
of the type shown in Figure 2. 

figure 8 illustrates an online process for generating personalized product detaB pages of the type shown in Figure 

3. 

5 Figure 9 fllustrates an offline process for generating email notifications of hotselling products as in Figure 4. 

Figure 1 0 illustrates a process for notifying community members of purchases made within the community. 

DETAILED DESCRIPTION OF A PREFERRED EMBODIMENT 
A set of online services referred to herein as "Community Interests' will now be described in (total The services 
will initially be described with reference to example screen displays which illustrate the services from the perspective of 
1 0 end users. A set of example data structures and executable components that may be used to implement the services will 
then be described with reference to architectural end flow diagrams. 

The illustrated screen displays, data structures and processing methods used to implement the disclosed 
functions are largely a matter of desip choice, and can be varied significantly without departing from the scope of the 
invention. In addition, although multiple different services will be described as part of a single system, it will be recognized 
1 5 that any one of these services could be implemented without the others. Accordingly, the scope of the invention is defined 
only by the appended claims. 

To facffitate an understanding of one practical appication, the Community Interests services will be described 
primarily in the context of a hypothetical system for assisting users of a merchant Web site, such as the Web site of 
Amazonxom, in locating and evaluating book titles within an electronic catalog. It wiB be recognized, however, that the 
20 services and their various features are also appicable to the marketing and sales of other types of hems. For example, m 
other embodiments, the items that are the subject of the services could be cars sold by an onu'ne car dealer, movies titles 
rented by an online video store, computer programs or informational content electronically dowrtfoarted to users' 
computers, or slock and mutual fund shares sold to onJne investors. Further, it should be understood that the "purchases" 
referred to herein need not involve an actual transfer of ownerslup, but could rather involve leases, licenses, rentals, 
25 subscriptions and other types of business transactions. 

As with the Amazon.com Web site, it wBI be assumed that the hypothetical Web she provides various services 
for allowing users to browse, search and make purchases from a catalog of several miffion book, music and video titles. It 
is also assumed mat information about existing customers of the site is stored in a user database, and that this information 
typically include! the names, shipping addresses, email addresses, payment information and purchase histories of the 
30 customers. The information that is stored for a given customer is referred to collectively as the customer's "user profBe." 

The Community Interests services operate generally by tracking purchases of books within particular user 
communities, and usmg this information to assist potential customers in locating and evaluating book titles. The services 
can also be used with other types of products. The wirmuratas preferably include both "expBch membership 
^immunities" that users actively join, end "implicit membership communities" that are computer or otherwise identified 
35 from information known about the user (e.g., stored in the user database). Examples of irnplicit membership communities 
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include domain-based communities such as Micwsoft.com Users and geographic region base communities such as New 

Orleans Area Residents; memberships to these two types of communities may be determined from user email addresses 

and shipping addresses, respectively. 

The system may also use implicit membership communities (or which membership is based in-whole or in-part on 
5 the purchase activities of the users. For example, the implicit membership community "fishermen" may include afl users 

that have purchased a book about fishing. Where purchase histories are used, the communities may be derated or inferred 

from such purchase histories using clustering techniques. 

In other embodiments, the various features of the invention may be implemented using only one of these two 

types of communities {explicit membership versus implicit membership). In addition, the services may be implemented using 
10 "hybrid* communities that are based on information known about the user but that are actively joined; for example, the 

user could be notified that a community exists which corresponds to his emafl domain or purchase history and then given 

the option to join. 

The Community Interests system includes four different types of services. The first, referred to herein as 
"Community Bestsellers," involves generating and displaying lists of the bestsefl'mg titles within specific communities. 

15 Using this feature, users can identify the book titles that are currently the most popular within their own communities 
and/or other communities. The bestseBing titles are preferably identified based on the numbers of units sold, but could 
additionally or alternatively be based on other sales related criteria. In other embodiments, the lists may be based in-whole 
or in-part on other types of data, such as user viewing activities or user submissions of reviews and ratings. 

One preferred method that may be used to identify bestsetling or popular titles involves monhoring the "velocity" 

20 of each product (the rate at which the product moves up a bestsellers list) or the "acceleration" of each product (the rate 
at which the velocity is changing, or at which sales of the product are increasing over time). This method tends to surface 
products that are becoming popular. To identify the popular hems within a particular community, the velocity or 
acceleration of each product purchased within that community can be compared to the product's velocity or acceleration 
within the general user population. Velocity and acceleration may be used both to generate bestseller fists and to identify 

25 "hot" products to proactively recommend to users (as discussed below). 

The second service, referred to herein as "Contact Information Exchange," involves informing a user that is 
viewing a particular product of other users within the same community that have purchased the same or a similar product 
For example, when a user within Netscapejcom Users views a product detail page for a particular book on programming, 
the page may include the names and email addresses of other Netscape.com users that have recently purchased the title, 

30 and/or an instant messaging box for sending a message to any such user that b currently onine. To protect the privacy of 
the recent purchasers, their names and/or email addresses may bo masked, in which case an email aSas or a bulletin board 
may be provided for communicating anonymously. This feature may also be used to display the contact information of 
other users that have bought from or otherwise conducted business with a particular seller. 

The third service, referred to as "Hotseller Notification,* automatically notifies users of titles that have become 

35 unusually popular within their respective communities. For example, a user within a particular hiking dub might be notified 
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that several other users within his dub have recently purchased a new book on local hiking trails. In one embodiment, a 
community's "hotsellers" are identified by comparing, for each title on the community's bestseller Bst. the tide's popularity 
within the community to the title's popularity within the general user population. The popularities of the titles are 
preferably based at least in-part on numbers of units sold . but may be additionally or alternatively be based other types 
of criteria such as user viewing activities or user submissions of reviews and ratings. 

One such method that may be used to identify the hotsellers (or for generating communhy recommendations in 
general) involves applying an algorithm referred to as the censored chi-square recommendation algorithm to the purchase or 
other history data of users. The effect of the censored chi-square recommendation algorithm (when applied to purchase 
history data) is to identify a set of "characterizing purchases" for the community, or e set of items purchased whhm the 
community which distinguishes the community from a general user population |e.g., all customers). The results of the 
algorithm may be presented to users in any appropriate form, such as a community popular Kerns fist, a notification email, 
or a set of personal recommendations. The censored chi-square algorithm b described in the attached appendix, which 
forms part of the disclosure of the specification. Another such method that may be used to identify the community 
hotsellers involves comparing each title's velocity or acceleration within the community to the thles's velocity or 
acceleration within the general user population. 

A fourth service, referred to as "Purchase Notification." automatically notifies users of purchases Gnduding titles 
and the contact information of the purchaser) made within their respective cwnrnunities. This service may, for example, be 
made available as an option where the community members have aD agreed to share their purchase information. 
Alternatively, users may have the option to expose their purchases to other community members on a user-by-user and/or 



,.jm basis. 

Figure 1 illustrates the general form of a sign-up page that can be used to enroll with the Cmrarnmfty brterests 
services. Although some form of enrollment is preferred, rt will be recognized that Community Bestsellers. Hotsefler 
Notification. Contact Information Exchange and Purchase Notification services can be implemented without requiring any 
active participation by the site's users. For example, all four services could be based solely on the Internet domains of the 
users, without requiring users to actively join communities. In addition, the communities could be defined automatically 
based on correlations between purchases; for example, all users that purchased more than X books within the "Business 
and Investing" category could automatically be assigned to a Business and lm*sting cmraramity. 

As illustrated by Figure 1. the sign-up page includes drop-down Bsts 30 for allowing the user to specify 
membership in one or more explicit membership communities. The communities that are presented to the user are those 
that are currently defined within the system. As described below, new communities may be added by system 
administrators, regular users, or both. In some cases, the ta^1*Xm***«*l**Amvm«BitaHA 
. on information known about the particular user. For example, the selections presented in the "local communiry groups" 
and "local outdoors clubs" lists may be generated based on the user's shipping address. 

Any of a variety of other interface methods could be used to collect comimmity rnembership irrrarrnation from 
users. For example, rather that having the user select from a drop-down list, the user could be prompted to type-in the 
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names of the communities to which the. user belongs. When a typed-in name does not match any of the names within the 
system, the user may be presented with a Est of "dose matches" from which to choose. Users may also be provided the 
option of viewing the membership lists of the communities and specifying the users with which to share information. 

As illustrated by the Gnk 32 and associated text in Figure 1, users may also be given the opportunity to add new 
communities to the system. In the illustrated embodiment, a user wishing to add a new community has die option of 
designating the community as "private," meaning that the community's existence and/or data will not db exposed to the 
general public. Private communities may be useful, for example, whan a closed group of users wishes to privately share 
information about its purchases. Upon creating a private community, the user may, for example, be prompted to enter the 
email addresses of prospective members, in which case the system may automatically send notification emails to such 
users. Through a sbnilar process, companies and organizations may be provided the option of designating their domain- 
based communities as private. 

The sign up page also includes check boxes 36-38 for allowing users to participate in the Contact Information 
Exchange. HotseUer Notification, and Purchase Notification services, respectively. In each case, the user may select a 
corresponding Gnk 40-42 to an associated form page (not shown) to limit participation to specific communities and/or 
product categories. Each user may abo be given the option to expose his or her purchases and/or contact information to 
others on a user-by-user basis. 

When the user selects the submit button 46, the user may be asked certain questions that pertain to the selected 
communities, such as university graduation dates and majors. The user may also be prompted to enter authentication 
information that is specific to one or more of the selected communities. For example, the user may be asked to enter a 
romrmmity password (even if the community is not private), or may be asked a question that all members of the group are 
able answer. A community may also have a designated "group administrator 0 that has the authority to remove 
unauthorized and disruptive users from the group. 

The user's community selections, community data, and service preferences are recorded within the user's profie. 
Also stored within the user's profile are any domain-based or other implicit membership communities of which the user b a 
member. The user's community membership profile may also be recorded within a cookie on the user's machine; this 
reduces the need to access the user database on requests for Web pages that are dependent on this membership profie. 
One method which may be used to store such information within cookies is described in U.S. provisional appL no. 
60/1 1 8,266, the disclosure of which b hereby incorporated by reference. 

Figure 2 illustrates the general form of a personalized Web page (referred to herein as the "community bestsellers 
page") which may be used to display the community bestseller lists. This page may be eccessed, for example, by selecting 
a fink from the site's home page. Cornrnun'rry bestseller lists couM additionally or alternatively be provided on other areas 
of the site. For example, the bestseller list of the Nasa.com domain could automatically be displayed on the home page for 
any user that has purchased a book on space exploration; or. when a user from the domain nttkfnsey.com makes a 
purchase, the user might be presented the message "would you Bke to see the bestsellers from the McKinsy & Co. group?" 
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In the Figure 2 example, it is assumed that the.user is a member of the explicit membership community Cascade 
Bicycle Club and the implicit membership community Mkrosoft.com Users. For each of these corranunraes (as weD as any 
other communities of which the user is a member), the page includes a hypertextual fisting of top selling book titles. The 
methods used to generate these lists are described below. Users may also be given the option (not shown) to view all trtfes 
purchased within their respective communities. 

As depicted by the drop-down tst 50 in figure 2, the user may also be provided the option of viewing the 
bestseller lists of other communities, including cwnmunities of which the user is not a member. As in this example, the 
listing of other communities may be ordered according to the known or predicted interests of the user. A community 
directory structure or search engine may also be provided for assisting users m finding communities and their bestseller 
fists. 

As further illustrated by figure 2, some of the communities may be "composite" communities that are formed as 
the union of other, smaller communities. In this example, the composite communities are AB U.S. Bicycle Clubs, which 
consists of aD regional and other bicycle club communities in the U.S., and Domains of All Software Companies, which 
consists of domains-based communities of selected software companies. Other examples include At law Students and AB 
Physicians. Bestseller lists for composite communities are particularty hetpful for identifying hook titles that are popular 
across a relatively large geographic region. For example, a user searching for a book on biking the United States, or on 
biking in general, would more likely find a suitable book in the AB U.S. Bicycle Clubs bestseUer Est than in the Cascade 
Bicycle Club bestseller list. 

In the preferred embodiment a user can be a member of a composite corranunity only through membership in one 
of that composite community's member, base communities. (A "base community." as used herein, b any non-crmurosite 
corranunity. regardless of whether it is part of a composite communityj The composite communities that are exposed to 
the general user population could be defined by system administrators; alternatively, the composite communities could be 
defined automatically, such as by grouping together an base commimrdes that have certain keywords in their titles. 

In one hnplementatjon, users can also define thai own, "personal* composite comniunifes. such as by selecting 

i from a Est (not shown) of base communities and assigning a community name. Using mis feature, a user could, for 
example, define a composite community which consists of all kayaking date on the West Coast or of a selected group of 
hi-tech companies. If the user has defined a personal composite community, that comrmrrmYs bestseller fist is preferably 
automatically displayed on the user's community bestsellers page IRgure 2). As with the user's community membership 
prof Be, the definitions of any personal composite communities specified by the user may be stored within a cookie on the 

1 user's machine. 

As further illustrated by figure 2. users can also view a bestseller fist of the general user population (e.g, an 
Amazon.com users). The general user population b treated es special type of community (Le., h b neither a base 
corranunity nor a composite community), and is referred to herein as tha "global ccrnmunity." 
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Another option (not illustrated) involves allowing users to specify subsets of larger communities using 
demographic filtering. For example, a user within the MIT community might be given the option to view the bestseffing 
titles among MIT alumnus who fall within a particular age group or graduated a particular year. 

Figure 3 depicts an example product (book) detail page which illustrates one possible form of the Contact 
5 Information Exchange service. Detail pages of the type shown 'm figure 3 can be located using any of a variety of 
navigation methods, including performing a book search using the site's search engine or navigating a subject-based browse 
tree. The contact information 58 of other community members that purchased the displayed book title (preferably within a 
certain period of lime), or possibly similar titles, is displayed at the bottom of the page. In other embodiments, the contact 
information may be displayed without regard to community membership. 
10 In the illustrated embodiment, the contact information 5B includes the name, ema3 address and common 

communities of the users, although telephone numbers, residence addresses, chat boxes and other types of contact 
information could additionally or alternatively be included. In the example shown in Figure 3, the user viewing the book 
detail page might contact such other users to ask their opinions about the book, or about the bike tours described therein. 
In addition, the contact information might be useful for arranging a group trip. As depicted in Figure 3, the page may also 
1 5 include a fink 60 or other type of object for sending an email or other message to the feflow conmunity member. 

In one embodiment (not illustrated), once the relevant set of "prior purchasers" has been identified, the system 
uses weB known methods to determine whether any of these other users is currenry online. If one or more of the prior 
purchasers is online, the user is presented an option to send an instant message to prior purchaser^), and/or to set up a 
private chat room for communicating with prior purchasers. Thus, the contact information may simply be in the form of an 
20 instant messaging box or other option for chatting online with specific users. 

In other embodiments, the various contact information exchange features may be used to assist users in 
evaluating the reputation of a particular merchant For example, when a user views an auction of a particular seller, the 
contact information of other community members (or possibly non-community members) that bought from that seler may 
be displayed, or an option could be provided to chat with any such users that are currently online. Where the merchant has 
25 hs own Web site, the contact information could, for example, be displayed as Web site metadata using a browser add-on of 
the type provided by Alexa internet of San Francisco, California. 

Any of a variety of methods could be used for allowing the prospective purchaser to corranunicate with the fisted 
contacts anonymously. For example, as indicated above, the emaB addresses of the contacts could be special aliases 
created for comrnurricating anonymously On which case the prospective purchaser may simBariy be assigned an email aDas 
30 for the contacts to respond), or the prospective purchaser and the contacts could be given a fink to a private bulletin board 
page. 

Figure 4 illustrates an example of an email document which may ba used to notify community members of a 
hotseffing book title. S'milar notifications may be provided to users through customized Web pages and other 
communications methods. As described below, the email document is preferably sent to afl participating members of the 
35 amvnunity that have not already purchased the book. 
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In the illustrated example, the email document includes a textual description 66 which, among other things, 
includes a synapsis of the book title and informs the user of the level of acceptance the title has attained within the 
community. The description also includes a hypertextusl link 68 to the title's detaS page on the site. In addition, if the 
recipient user participates in the Contact Information Exchange program, the email document preferably includes a listing 

5 70 of the contact information of other community members that have purchased the book. 

Emafl notifications sent by the Purchase Notification service (not shown) may likewise include a synopsis of the 
purchased product and a link to the product's detail page. In addition, where the purchaser has elected to participate in the 
Contact Information Exchange program, the email document may include the purchaser's contact information (and possibly 
the contact information of other community members who have purchased the product); for example, when User A in 

1 0 Community A purchases an item, an email may be sent to other members of Community A with a description of the product 
and User A's contact information. 

Having described representative screen displays of the Community Interests services, a set of Web site 
components that may be used to implement the services will now be described in detail. 

Figure 5 illustrates a set of Web site system components that may be used to implement the above-described 

15 features. Trie Web site system includes a Web server 76 which accesses a database 78 of HTML (Hypertext Markup 
Language) and related content. The HTML database 78 contains, among other things, the basic HTML documents used to 
generate the personalized sign-up, community bestsellers, and product detd pages of Figures 1-3. The Web server 76 
accesses service code 80, which in-turn accesses a user database 82, a community database 84, a bibliographic database 
of product data (not shown), and a database or other repository of community data 86. The various databases are shown 

20 separately in Figure 1 for purposes of llustration, but may in practice be combined within one or more larger database 
systems. The service code 80 and other executable components may, for example, run on one or mora Unix or Windows 
NT based servers and/or workstations. 

The community data 86 includes a "community bestseller fists* table 86A which contains, for the global 
community and each base community, a listing of the currently bestselEng book titles. In some implementations, the listing 

25 for the global community is omitted. In the illustrated embodiment, each entry 88 in each bestseller fist includes: (a) the 
product ID (ProdID) of a book tide, and (b) a count value which represents, for a given time window, the number of copies 
purchased by members of the community. The product IDs may be assigned or processed such that different media 
formats (efl, paperback, hardcover, and audio tape) of the same true are treated as the same hem. As described below, 
the community bestseller fists table 86A is used both for the generation of bestsetfer fists and the generation of hotsefter 

30 notifications. 

The community data 86 also includes, for each base community, 8 respective product-to-inernber mapping table 
866 which maps products to the community members that have recently purchased such products (e*, within the last 2 
months). For example, the entry for product Prod_A within the table 86A for Community A is in the form of a fating of the 
user IDs and/or contact information of members of Cwrm unity A that have recently purchased that product In the 
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preferred embodiment, only those community members that have opted to participate in the Contact Information Exchange 
service are included in the fists. 

As mentioned above, the user database 82 contains information about known users of the Web site system. The 
primary data hems that are used to implement the Community Interests service, and which are therefore shown in figure 
5, are the users' purchase histories, community memberships, service preference data (e*. whether or not the user 
participates in the Contact Information Exchange and Hotseller Notification services), and shipping information. Each 
user's purchase history is in the general form of a list of product IDs of purchased product together with related 
information such as the purchase date of each product and whether or not the purchase was a designated by the user as a 
-gift." Purchases designated as gifts may be ignored for purposes of evaluating community interests. Each user's 
database record also preferably includes a specification of any personal composite communities the user has defined, for 
viewing customized bestseller lists. 

With further reference to figure 5, the community database 84 contains inf ormation about each base community 
(including both explicit and implicit membership base communities when both types are provided) that exists within the 
system. This information may include, for example, the community name, the type of the community (e.g.. 
college/university, local community group, etc.), the location (city, state, country, etc) of the community, whether the 
community is private, whether the community participates in the Purchase Notification service, any authentication 
infoimation required to join the commun'ity, and any community policies (e.g., by joining, all users agree to expose their 
purchases to other members). For implicit membership communities, the database 84 may also include information about 
the user database conditions which give rise to membership. As indicated above, the information stored within the 
communities database 84 may be generated by end users, system administrators, or both. 

The community database 84 also includes information about any composite communities that have been defined 
by system admmistrators. For each composite community, this information may include, for example, the community name 
and a list of the corresponding base communities. For example, for the All Bicycle Clubs community, the database would 
contain this name and a list of all existing bicycle club base communities. 

As depicted by figure 5, the community database 84 may also conta'm information about relationships or 
associations between base communities. The information may be specified by system administrators, and may be used to 
identify similar communities for display purposes. For example, when a user of the Mknsoftxom Users cornmunity views 
the community bestsellers page (Figure 2), the associated community Ketsc3pe.c0m Users may automatically be displayed 
at the top of the drop-down Gst 50, or its bestseller Est be displayed on the same page. 

As illustrated by Figure 5, the service code 80 includes five basic processes 80A-80E that are used to implement 
the Community Interests services. (As used herein, the term "process" refers to a computer memory having executable 
coda stored therein which, when executed by a computer processor, performs one or more operations.) Each process is 
illustrated by one or mora flow diagrams, the figure numbers of which are indicated in parenthesis in Figure 5. The first 
process 80A is an ofMine process (meaning that it is not executed in response to a page request) which b used to 
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periodically generate the tables 86A and 86B based on information stored in the user and community databases 82, 84. 
Processes 80B-80D use these tables to perform their respective functions. 

The second process 80B is an online process which is used to generate personalized community bestsellers pages 
of the type shown in Figure 2. The third process 80C is an online process which is used to generate product detail pages 



within notification emails of the type shown in Figure 4. The fourth process 800 is an offline process which is used to 
identify and notify users of hotselling products within specific communities. The fifth process 80E is used to implement 
the Purchase Notification service. 

Figure 6 illustrates the steps performed by the table generation process 80A to generate the tables 86A. 86B. 

1 0 The process may, for example, be executed once per day at an off-peak time. A process which updates the tables in real- 
time in response to purchase events may alternatively be used. In step 100, the process retrieves the purchase histories of 
all users that have purchased products within the last N days (e.g., 60 days). Submissions of ratings or reviews may be 
treated as purchases and thus included in the purchase histories. The variable N specifies the time window to be used both 
for generating bestseller lists and for identifying hotselling hems, and may be selected according to the desired goals of the 

1 5 service. Different time windows could alternatively be used for generating the bestseller fists and for identifying hotselling 
' Hems; and different time windows could be applied to different types of communities. 

In step 102, the retrieved purchase histories are processed to build a list of aft products that were purchased 
within the last N days. Preferably, this list includes any products that were purchased solely by global community 
members, and thus is not limited to base community purchases. 

20 In step 104, the process uses the data structures obtained from steps 100 and 102 to generate a temporary 

purchase count array 104A. Each entry in the array 104A contains a product count value which indicates, for a 
corresponding community: product pair, the number of times the product was purchased by a member of the community in 
the last N days. For example, the array 104A shown in Figure 6 indicates that a total of 350 users purchased product 
"PRODI," and that three of those purchases came from base community "BASE_1." A pseudocode listing of a routine 

25 that can be used to generate the array is shown in Table 1 . Multiple purchases of the same product by the same user are 
preferably counted as a singe purchase when generating the array. 



5 with contact information as shown in Figure 3; and which may also be used to cor 



i to be displayed 




For each user; 



For each product purchased by user in last N days; 

For each community of which user is a member; 

increment purchase_count(community, product) 



TABLE 1 
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In step 106, the data stored in the array is used to generate the community bestseller lists. This task involves, 
for each base community and the global community, forming a fist of the purchased products, sorting the fist according to 
purchase counts, and then truncating the list to retain only the X |e.g., 100) top selling titles. A longer bestsellers list (e.g, 
the top selling 10,000 titles) may be generated for the global community, as is desirable for identifying community 
5 hotselters. 

As indicated by the parenthetical in block 106, product velocity and/or acceleration may be incorporated into the 
process. The velocity and acceleration values may be calculated, for example, by comparing purchase-count-ordered fists 
generated from the temporary table 104A to (ike fists generated over prior time windows. For example, a product's 
velocity and acceleration could be computed by comparing the product's position within a current purchase-count-ordered 

1 0 list to the position within like lists generated over the last 3 days. The velocity and acceleration values can be used, along 
with other criteria such as the purchase counts, to score and select the products to be included in the bestseller fists. 

The bestseller lists are written to a table 86A of the type depicted in figure 5, and the new table replaces any 
existing table. The bestseBers fists of base communities that have less than a pre-specified threshold of total sales (e.g, 
less than 5) may optionally be omitted from the table 86A. Bestseller fists for the composite communities defined by 

15 system administrators could also be generated as part of the Figure 6 process, or could be generated "on-the-fly" as 
described below. 

The last two steps 108. 110 of Figure 6 are used to generate the product-to-member mapping tables 868 of 
Figure 5. The first step 108 of this process involves generating a temporary table (not shown) which maps base 
communities to corresponding members that have opted to participate in the Contact Information Exchange program 
20 ("participating members"). In step 1 1 0, this temporary table and the purchase histories of the participating members are 
used to generate the product-to-member mapping table 86B for each base community. The contact information of the 
participating members may also be stored in these tables 86B to reduce accesses to the user database 82. Although a 
separate table 86B is preferably generated for each base community, a single table or other data structure could be used. 
Any of a variety of other types of user activity data could be monitored and incorporated into the Figure 6 
25 process as a further indication of product popularity. Such data may include, for example, "cfick-through' events to 
product detail pages, "add to shopping cart" events, and product ratings and reviews submitted by users. 

Figures 7A and 7B illustrate the steps that are performed by the community bestseller processing code 80B to 
generate personafized community bestseller pages of Ihe type shown in Figure 2. The first step 120 in Figure 7A involves 
generating a fist of the communities for which bestseller lists are to be generated end displayed. If the user has already 
30 selected one or more communities from the drop down box 50 (Figure 2), these selected communities are included hi this 
fist. H the user's identity is known, the user's base immunities and personal composite communities, if any, may be 
added to this fist H the fist is empty at this point, a set of default communhies may use! User rdentfties are preferably 
determined using browser cookies, although a login procedure or other authentication method could be used. In other 
implementations, the community bestseDer lists may be displayed without regard to the user's community membership 
35 profile. 
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The next step 124 involves generating the bestseller lists for each of the selected communities. This process is 
illustrated by Figure 7B and is described below. In step 126, the process identifies any communities that are related to the 
user's base communities, so that these related communities can be displayed within or at the top of the drop down list 50 
(Figure 2). Any composite community which includes one of the user's base coirnnunhies may automatically be included in 
this Ost In addition, information stored in the community database 84 may be used to identify related base communities. 
In other implementations, this step 124 may be omitted. Finally, in step 128, the bestseller fists and the fist of related 
communities are incorporated into the community bestsellers page. 

With reference to Figure 7B, if the community is not a composite community (as rtetemraied in step 134), the 
community's bestseller list is simply retrieved from the table 86A (step 136). Otherwise, the bestseller fists of all of the 
composite conwnuruty's member base communities are retrieved and merged (steps 138-142) to form the bestseller fist As 
part of the merging process, the product count values could optionally be converted to normalized score values (step 138) 
so that those communities with relatively large sales volumes will not override those with smaller sales volumes. For a 
given product within a given bestseller list, the score may be calculated as (product's purchase count)/|total purchase count 
of bestseller Hst). The fists are then merged while summing scores of tike products (step 140), and the resulting 1st is 
15 sorted from highest to lowest score (step 142). If the composite community is one that has been defined by system 
administrators (as opposed to a personal composite community defined by the user), the resulting bestseller fist may be 
added to the table 86A or otherwise cached in memory to avoid the need for regeneration. 

As depicted in step 144, one optional feature involves fBtering out from the bestseller fist some or all of the 
products that exist within the global community's bestseller list. For example, any book tide that is within the top 500 
20 bestseller's of the general population may automatically be removed. Akernativery, such titles could be moved to a lower 
position within the list This feature has the effect of highlighting products for which a disparity exists between the 
product's popularity within the global community versus the community for which the bestseller fist is being generated. 
This feature may be provided as an option that can be selectively enabled or invoked by users. Products could additional 
or alternatively be f Btered out based a comparison of the product's velocity or acceleration within the particular corrunimty 
25 to the product's velocity or acceleration within the global community. 

As illustrated by step 146. the bestsefler fist is truncated (such as by taking the top 10 entries) and then returned 
to the process of Figure 7A for incorporation into the Web page. The Figure 7B process is repeated for each community to 
be included within the community bestsellers page. 

Figure 8 illustrates the steps that ate performed by the product detail page process 80C to generate detail pages 
30 (as in Figure 3) for participants in the Contact Information Exchange program. As indicated above, product detail pages can 
be accessed using any of the she's navigation methods, such as conducting a search for a title. In step 150, a list of the 
base Bmrmuiuties of which the user is a member is obtained - either from a browser cookie or from the user database 82. 
For each base community in this fist, that community's producMoinember mapping table 86B (Figure 5) is accessed to 
identify any other users within the community that have purchased the product In step 154. the contact information for 
35 each such user is read from the table 86B or from the user database 82. In step 158, the eontact information and 
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associated base community names are incorporated into the product's detail page. As indicated above, an option may 
additionally or alternatively be provided for the requester of the page to chat with any such other users that are currently 
online. 

Figure 9 illustrates the off line sequence of steps that are performed by the hotseller notifications process 80D. 

5 The general purpose of this process is to identify, within each base community, any "hotsefling" products {based on pre- 
specifted criteria), and to call such products to the attention of those within the community that have not yet purchased 
the products. The sequence 160-168 is performed once for each base community. In other implementations, the process 
could also be used to identify hotsellers in composite communities. 

In step 160, the process sequences through the products in the community's bestseller list while applying the 

1 0 hotseller criteria to each product. If multiple products quaGfy as hotsellers, only the "best" product is preferably selected. 
In one embodiment, a product is flagged as a hotseller if more than some threshold percentage (e.g., 5 %( of the 
community's members have recently purchased the product, as determined from the data within the community bestseller 
fists table 8 6 A. This threshold could be a variable which depends upon the number of members of the community. 

In another embodiment, the position of the product within the community's bestseller list is compared to the 

1 5 product's position, if any, within the global community's bestseller list For example, any title that is in one of the top ten 
positions within the community's list but which does not appear in the top 1000 bestsellers of the general population may 
automatically be flagged as a hotseller. In addition, as mentioned above, hotsellers may be identified by comparing the 
product's velocity or acceleration within the community to the product's velocity or acceleration within the global 
community. In addition, the censored chi square algorithm described in the attached appendix may be used to identify the 

20 hotsePers. In other implementations, these and other types of conditions or methods may be combined. 

If no hotseller is found for the community (step 162), the process proceeds to the next base community (step 
170), or terminates if all base communities have been processed. If a product is found, the product-tc-member mapping 
table 86B (Figure 5) is accessed to identify and obtain the contact information of any participating members that have 
purchased the product (step 164). In step 166, the process generates an email document or other notification message. 

25 As in Figure 4, this message preferably includes the contact information and a description of the product. In other 
implementations, the notifications may be communicated by facsimile, a customized Web page, or another communications 
method. 

In step 168, the notification message is sent by ema3 to each base community member who both (1) has not 
purchased the product and (2) has subscribed to the email notification service. Such members may be identified by 
30 conducting a search of the user database 81 The notification messages could alternatively be sent out to afl community 
members without regard to (1) and/or (2) above. For users that have not subscribed to the Contact Information Exchange 
service, the contact information may be omitted from the notification message. 

figure 10 illustrates a sequence of steps that may be performed to implement the Purchase Notification service. 
This process may be implemented whenever a user completes the check-out process to purchase one or more products, in 
35 step 180, the user's profile is checked to identify any base communities in which the user participates in the Purchase 
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Notification service. For each such community, all other participating members are identified in step 182. In step 164, a 
notification message is generated which includes a description of the purchased product(s) and the name of the common 
community. If the user participates in the Contact Information Exchange service, the contact information of the purchaser 
may also be included within this message. In step 188, the notification message is sent by email to aO participating 
5 members identified in step 1 82, Alternatively, purchase notifications that have accumulated over a period of time may be 
displayed when a user logs into the system. 

The various community-related features described above can also be implemented in the context of a network- 
based personal information management system. One such system is implemented through the Web ate of RanetAII 
(www.planetall.com). Using this system, users can join various online communities and can selectively add members of 

1 0 such communities to a virtual, personal address book. In addition, each user can selectively expose his or her own personal 
information to other community members on a user-by-user and datum-by-datum basis. Additional details of this system 
are described in U.S. appL no. 081962,997 titled NETWORKED PERSONAL CONTACT MANAGER filed November 2, 1997, 
the disclosure of which is hereby incorporated by reference. 

fn the context of this and other types of network-based address book systems, the contacts listed within a 

15 user's address book may be treated as a "community" for purposes of implementing the above-described features. For 
example, a user may be given the option to view the products purchased by other users fisted in his or her address book (or 
a particular section of the address book), or to view a bestsellers list for such users. Further, when the user views a 
product detail page (or otherwise selects a product), the contact information of other users within the address book that 
bought the same product may be displayed. Further, a user may be given the option to conduct a search of a friend's 

20 address book to locate another user that purchased a particular product 

Although this invention has been described in terms of certain preferred embodiments and applications, other 
embodiments and applications that are apparent to those of ordinary skJl in the art, including embodiments which do not 
provide all of the features and advantages set forth herein, are also within the scope of this invention. Accordingly, the 
scope of the present invention is intended to be defined only by reference to the appended claims. 
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Appendix 

1. Overview 

5 The censored chi-square recommendation algorithm constructs a set of candidate recommendations for a predefined 
group of customers, it then conducts a statistical hypothesis test to decide whether or not these candidate 
recommendations are really a result of group preferences which differ from the preferences of the overall customer base. 
If the conclusion is that group preferences do differ significantly from overall customer preferences, the recommendations 
are presented to the group. 

10 

The inputs to the censored chi-square algorithm are the purchases made by the group (over some time period) and the 
purchases made by all customers (over the same time period). Other types of events, such as Hern viewing, downloading 
and rating events, can additionally or alternatively be used. 

15 The purchases of the entire customer base are used to formulate expectations about bow many customers in the group 
wffl have purchased each available item, given the total number of purchases by the group. The 'group purchase count" 
for each hem is the number of customers in the group who actually purchased the hem. The candidate recommendations 
are first restricted to be those items whose group purchase counts exceeded expectations. Of these candidates, only 
those hems with the largest group purchase counts are then retained. These final candidates are sorted according to how 

20 much their group purchase counts exceeded expectations (subject to a normalization). The values used to sort the 
candidates are called the "residuals". 

These residuals form the basis of a test statistic which leads to an estimate of the probability that expectations about 
the group are the same as expectations about aD customers. K this probabiSty is low, h is inferred that the group's 
25 preferences are significantly different from the preferences of all customers, and the recommendations are returned as 
output If the probability is high, on the other hand, then little evidence exists to suggest the group's preferences differ 
from overall preferences, so no recommendations are returned. 

2. Algorithm for Constructino Censored Chi-Souare Recommendations 

30 

let A be the set of customers in the purchase circle (community) under consideration. 

With respect to the minimum lookback horizon L such that SJ.99} (defined below) is at least 5: 

Define P-{<ci>:cvnAandc purchased item i at least once between today and L periods ago } 

35 
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Let |P| -n. 

Define I - { i : there exists a c lin A such that < e, i > tin P } 

Define observed counts, expected counts, residuals and standardized residuals as follows: 

0(9- | { c : c V'n A and c purchased i within L } |,i\inl 

e{i) - n * phatj, phatj is the estimated purchase probability for I, i Wn I 

rli)- o(i)-e(i),Hinl 

r_s(D-r(i)/sqrt(e(i)),i\inl 

Define I* Isubset I - { i : i lin I and rfi| > 0 } 

Let S be the image of I* under o(i). Let | S | - d. 

^ sjl), SJ2), ~, SJd) be the order statistics of S. thus SJd) is the number of distinct customers who purchased the 
most-purchased (positive- residual) item. Note ties are common, so that a subsequence SJi). S_(i+1), SJi+j) may 
have all elements equal. 

Let S_{c). 0 < - c < - 1. be the cth quantile of S. that is, (100*c)% of to other elements in S are less than or equal 
toS {c}. Interpolate and break ties as necessary to determine S_{c}. 

Let SR be tlra set of standardized residuals which correspond to elements of S that are > » S_{.99) . 
Let|SR| -m. 

Let SRJ11 ~., SRJm) be the order statistics of SR. 

CaB the desired number of recommendations r. Then the order statistic index of the final recommendation candidate is r* 
- maxfriH+1, 1). 

ComputeT - \sumji-r'} *m SRJT 2 

Compute the p-value of T, i.e. Pr(X > T) where X * cX"2{n, r*). 

If the p-value achieves the desired significance level then the recommended items for the circle, in order, are SRJm), 
SRJm-1), - SRJr* +1). SRJr*). 
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3, Estimating the Samnlina Distributi on nf the Censored Chj-Square Statistic 

To construct a numerical approximation of the censored chi-square sampling distrojution under the null hypothesis, we 
5 employ a statistical resampling technique called the bootstrap. The idea is straightforward. We create a group of 
customers by simple random sampling with replacement from the entire customer base. By construction, the expected 
purchase allocations of such a group follow the probability model of our null hypothesis. We emphasize that this is simply 
an algebraic consequence of the method used to fit the null model and in fact the linearity of expectation guarantees that 
it holds algebraically regardless of any interdependences our model ignored in the joint distribution over purchase 



We then compute the censored chi square statistic for this random group, as presented above. We can think of the value 
so obtained as an approximate sample drawn from the censored chisquare's null distribution. By repeatedly ID 
constructing a set of customers randomly and (2) computing its censored chi square statistic, we approximate the so- 
called empirical distribution of the cX'2 under the null hypothesis. Under mild to moderate probabilistic conditions, the 
empirical distribution converges to the true null distribution of the statistic Thus an approximate 10011 • alpha)* 
significance level test for circle idiosyncrasy can be conducted by comparing the circle's cX*2 statistic value to the 
(alpha)th quantile of the bootstrapped empirical distribution. Also note that, as a sum of (tteoreticalty) independent 
random variables, the cX"2 sampling distribution should converge asymptotically to the normal iSstribution as the number 
of observations over which the statistic is computed grows large. We can determine when application of the normal 
theory is feasible by testing goodness-of-fit of the bootstrapped distribution to the normal, for example using the 
KolmogorovSmtmoff statistic. 

Under the assumptions of the null hypothesis, the value of the cX"2 can be shown to grow linearly in the total purchase 
count of the circlB (community) as well as the number of hems to recommend |i.e. terms in the cX'2 summation). Since 
the purchase probabilities are constants under the null hypothesis, these are the. only two variables with which the cX"2 
grows. So in theory we would want to bootstrap a distribution for each possible <n, r> pair, where n Is the circle's 
purchase count and r the number of recommended items. In practice, both n and r are random variables which depend on 
the particular set of random customers we assemble at each iteration of the bootstrap. So we bootstrap various random 
group sites at various lookback horizons, then recover the sampling distributions from the <n, r> values impneitfy 
obtained in the course of each iteration. We can then construct approximate empirical distnTiirtions for <n, r> intervals 
which are large enough to contain enough observations for us to get useful convergence to the true null distribution. With 
these parameterized approximate sampling distributions available, we conduct a hypothesis test using the senium 
se < n. r > interval contains the values of n and r actually obtained for the circle being tested. 
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IV. Determination of Optimal Lookback Horizon 

Before testing the hypothesis that a particular purchase circle follows the probability model to aDocate its purchases 
across Hems, we decide how much of the circle's available transaction data to use in computing the censored chkquare 
test statistic. We choose to utilize data looking sequentially backwards in time, without weighting observations. Thus the 
question of how much data to use is equivalent for our purposes to asking how many prior days of data to include in the 
computation. We refer to this number of days as the lookback horizon associated with the purchase circle. 

In general, the power of a test statistic (the probability the test statistic will detect deviations from the null hypothesis) is 
a nondecreasing function of the amount of data provided, so using all avaBable data normally won't harm our statistical 
inferences. There are other drawbacks in our situation, however. First, the stationarity assumption behind the purchase 
probabifity estimates is at best only locally , correct. The further back in time we look, the more likely it is that 
nonstationarity in the purchase probabilities will manifest itself in our hypothesis tests. Since this nonstationarity impacts 
the bootstrap as well, it is actually a pervasive problem that can't be circumvented with simple resampling, and it wiD 
tend to cause us to detect circle idiosyncrasies where none actually exist 

Second, without researching the power function of the censored chi-square, we cannot make any statements about the 
expected power benefits of incrementally larger datasets. In fight of this, it makes sense to let computational efficiency 
dictate the sizes of the datasets used in hypothesis testing. In other words, knowing nothing about the relative value of 
larger datasets, we will use the smallest dataset which allows a given purchase circle to satisfy the reasonabiltty 
criterion. Currently this means that the observed count for the 99th percentile of the circle's positive-residual hems, 
ranked by observed count, must be at least 5. 

Determining the minimum lookback horizon consistent with this constraint would in genera! require repeated 
computations at successively longer horizons for a particular circle. Instead, for computational efficiency, we wtO 
forecast a horizon that has high probability of satisfying the constraint, accepting that in expectation some smaD 
percentage of circles will fail to satisfy it The forecast is produced as a side effect of the bootstrap computation (see 
above). Each random group size we bootstrap over will have iterations at many horizons. At each horizon, some fraction 
of the iterations win fail the reasonably criterion. We record all such failures. Roughly speaking, the fraction of failures 
should decrease as lookback horizon increases. Given a purchase circle whose minimum lookback horizon we want to 
forecast we find the bootstrap group size h is close to, then pick the shortest horizon which had an acceptable failure 
rate. If no bootstrapped horizon had an acceptably low rate, we choosB the longest horizon and accept that many 
idiosyncratic circles of that size will escape detection by failing the reasonabifity criterion. 
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WHAT IS CLAIMED IS : 

1. A method of assisting users in selecting hems from an electronic catalog of hems, the catalog 
accessible to users of an online store that provides services for allowing users to purchase hems from the catalog, the 



providing a database which contains information about a plurality of user communities, wherein 
different communities represent different subsets of users of the store; 

tracking online purchases of items from the store by the users to generate purchase history data, and 
storing the purchase history data in a computer memory; 

processing at least the purchase history data to identify at least one item which, based on pre- 
specified criteria, has become popular within a particular community; and 

electronically notifying members of the community that the at least one hem is popular within the 



2. The method of Claim 1, wherein electronically notifying members of the community comprises 
generating a Web page which includes a community-based most popular items list 

3. The method of Claim 2, wherein the most popular hems list is a bestsellers fist 

4. The method of Claim 1. wherein electronically notifying members of the communrty comprises 
automatically generating and sending an email message to members of the communrty. 

5. The method of Claim 4. wherein the email message contains contact information of at least one 
member of the community that has purchased an item described in the email message. 

6. The method of Claim 1, wherein processing the purchase history data to identify at least one hem 
comprises identifying a set of characterizing purchases for the corrrrmirity. 

7. The method of Claim 1, wherein the community is an impBcit membership conrnnmity. 
B. The method of Claim 7, wherein the community is based on emarl addresses of users. 

9. The method of Claim 1 , wherein the communrty is an explicit membership community. 

10. The method of Claim 1 , wherein the community it derived from an electronic 8ddress book of a user. 

11. The method of Claim 1. wherein the community a a composite crjmmunity which comprises mutrJpte 
other communities of the database. 

12. A system for assisting users of an online store in selecting hems from an electronic catalog of items, 
the system comprising: 

at least one database which contains purchase history data for users of the store, and which contains 
information about a plurality of user communities wherein different communities represent different subsete of 
users of the store; and 

a computer process which identifies items that are popular within particular communities of the 
plurality of communities by analyzing at least the purchase history data, and which notifies users of the store of 
the hems that are popular within particular communities. 



WO 00/62223 



PCT/US00/40077 



13. The system of Claim 12. wherein the process comprises a first process which generates a data store 
which contains bestselling hems fists for at least some of the communities, and a second process which selects hems from 
the table to display to users. 

14. The system of Claim 12, further comprising a user interface which allows users to select and join at 
least some of the user communities. 

15. The system of Claim 12, further comprising a user interface which allows a user to define a composite 
community that includes multiple communities of the database, and to initiate generation of a popular hems fist for the 
composite community. 

16. The system of Claim 12, wherein at least some of the communities are implicit membership 
communities. 

17. The system of Claim 12, wherein at least soma of the communities are based on email addresses of 

users. 

18. A method of assisting users in selecting hems from an electronic catalog of hems, the catalog 
accessible to users of an online store that provides services for aBowing users to purchase items from the catalog, the 
method comprising the computer-implemented steps of: 

identifying a subset of users of the store that have email addresses that satisfy a particular criteria; 
identifying at feast one item that is popular among the subset of users, wherein the step of identifying 
comprises processing purchase history data of at least the subset of users; and 

electronically notifying users of the store of a popularity of the at least one hem among the subset of 

users. 

19. The method of Claim 18, wherein identifying 8 subset of users comprises identifying aB users of a 
selected email domain. 

20. The method of Claim 1 9, wherein the selected email domain is an email domain of a selected company. 

21. The method of Claim 18, wherein identifying a subset of users comprises identifying aB users of a 
selected group of email domains. 

22. The method of Claim 18, wherein electronically notifying comprises generating a Web page which 
includes a list of bestselling hems among the subset of users. 

23. The method of Claim 1B, wherein electronically notifying comprises sending emaB notification 
messages to at least some of the users of the subset. 

24. A method of recommending items from a catalog of hems, comprising: 

identifying a community of users that represents a subset of a general population of users; 

tracking 8t least one type of user activity that indicates user affinities for particular hems of the 
catalog to generate history data; 

processing the history data of the general population of users, including the community of users, to 
identifying a set of hems that distinguish the comrmmity from the general population; and 
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recommending items from the set of items to members of the community. 

25. The method of Clann 24, wherein processing the history data comprises processing purchase history 
data, and the set of herns consists essentiaDy of items purchased by members of the community. 

26. The method of Clam 24, wherein tracking at least one type of user activity comprises tracking Hem 
viewing events. 

27. The method of Claim 24, wherein the community is an implicH-membership community. 

28. A method of assisting users in evaluating items of an electronic catalog of items, the catalog 
accessible to users of an online store that provides services for allowing users to purchase hems from the catalog, the 
method comprising the computer-implemented steps of: 

storing contact information for at least some of the users of the store within a computer memory; 
identifying an item to be displayed to a first user; 
identifying a second user that has purchased the Hem; 
retrieving the contact informaticn for the second user from the computer memory; and 
electronically notifying the firs; user of the contact information of the second user to allow the first 
user to communicate with the second user about the item. 

29. The method of Claim 28, wherein identifying the second user comprises: 
identifying a user community o' -which the first user is a member; and 
determining whether another member of the community has purchased the item. 

30. The method of Claim 28, wherein identifying an item to display to the first usBr comprises: 
identifying a community of which the first user is a member; and 

identifying an item that fa popuiar within the corrummity. 

31. The method of Claim 30, wherein identifying an Hem that is popular within the cmrmunity comprises 
identifying at least one hem which, based on purchases mads whhn the community relative to purchases made within a 
general user population, distinguishes the community from the general user population. 

3Z The method of Claim 28, wherein electronically notifying comprises incorporating the contact 
hvf ormation into a product detaD page requested by the first user. 

33. The method of Claim 28, wherein electronically notifying comprises sending to the first user an email 
message which contains the contact information and a description of the product; 

34. The method of Claim 28. further comprising generating a data structure which maps Hems to users 
that purchased the Hems. 

35. The method of Claim 28, wherein the contact information includes an email alias for permitting 
anonymous communications. 

36. The method of Claim 28, furir :r comprising presenting to users of the store at toast one option screen 
which permits users to authorize release of contar.'. information to other users on at least one of fa) a user-by-user basis, 
and (b) a conromity-by-communrty basis. 

-23- 



WO 00/62223 



PCTAJS00/40077 



37. . The method of Claim 28, whetain the commun'rtY is an impficit membership community. 

38. The method of Claim 37, wherein the implicit membership community is based on email addresses of 

users. 

39. The method of Claim 28. wherein electronically notifying the first user of the contact information 
comprises presenting the first user an option to chat onDne with the second user. 

40. A system for assisting users of an online store in evaluating items of an electronic catalog of items, the 



a data structure which maps items from the catalog to users that have purchased such hems; and 
a process which responds to an online request by a first user to view a description of an hem from the 
catalog by at least (a) accessing the dB-a s:-ucture to identify a second user that has purchased tha hem, and (b) 
displaying contact fnformatiDn of the second user to the first user in conjunction with the description. 

41. The system of Claim 40, wherein the data structure maps hems to users that purchased the hems for 
each of a plurality of communities, and the process uses the data structure to locate a fellow community member that 
purchased the hem. 

42. The system of Claim 40, whureiu the data structure contains the contact information of users. 

43. The system of Claim 40, wherein the contact information includes ema3 aliases for at least some of 

the users. 

44. The system of Claim 40, wh-rei > the contact information includes an option for the fust user to chat 
online with the second user. 

45. A method of assisting a first nsnr in evaluating a merchant comprising: 
identifying a community of which the first is a member; 

identifying a second user that is a member of the community and that has engaged in business with the 



electronically notifying the firs' user of the contact information of the second user to allow the first 
user to communicate with the second usw about the merchant 
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To: Erin@Microsoft.com 
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Dear Erin Indianer; 




We thought you might won: to know that the title 

The Jovo Developers Almon 1999 by Patric Chan is currently hot 

within the Microsoft.com community. Listed below are the names 
and e-mail addresses of some members of the community that 
purchased the title 
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Synopsis: 

Arranged for rapid access to enhance programming efficiency, 

this is a powerful Java quick-reference with comprehensive, condense 

coverage of the new final "crsion of JDK 1.2 




Contacts: 1 

Richard Smith (RSmith@microsoft.com) 

John King (JKing9Microsofr.com) \-70 
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(RUN ONCE PER DAY) J 



RETRIEVE PURCHASE HISTORIES 
OF USERS THAT HAVE PURCHASED 
A PRODUCT IN THE LAST N DAYS 



| GENERATE LIST OF PURCHASED PRODUCTS \ 



FOR GLOBAL COMMUNITY ANO E .CH BASE 
COMMUNITY, GENERATE PURCHA E COUNT 
FOR EACH PURCHASED PR' ' UCT 



FOR GLOBAL COMMUNITY AND EACH BASE 

COMMUNITY, GENERATE ORDERED UST 
OF X BESTSELUNG PRODUCTS, (OPTIONALLY 
INCORPORATING PRODUCT VELOCITY 
OR ACCELERATION), AND OUTPUT UST 
AND COUNT VALUES TO BES1.-ELLING 
BESTSELLER UST TABLE (FiG. 5) 



COMMUNITY 


PROD1 PROD2 PROD3»« 


GLOBAL 


350 18 30 


BASE_1 


3 0 2 


BASE_2 


0 1 0 


BASEJS 


9 0 0 







I BUILD TEMPORARY TABLE WHICH APS BASE I 
COMMUNITIES TO PARTICIPATING MEMBERS | 



FOR EACH BASE COMMUNITY, IDENTIFY 
PRODUCTS PURCHASED BY PARTICIPATING 
MEMBERS, AND RECORD USERS IN 
COMMUNITY'S PRODUCT- TO- MEMirERS TABLE 
(FIG. 5) 
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IDENTIFY COMMUNITIES FOR WHICH 
BESTSELLEF LISTS ARE TO BE 
D. SPLAYED 



GENERATE BESTSELLER 
LISTS 
(FIG. 7B) 
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IDENTIFY COMMUNITIES RELATED 
TO USER'S -3ASE COMMUNITIES 



INCORPORATE BESTSELLER LISTS 
AND LIST OF DELATED COMMUNITIES 
INTO COMMUNITY BESTSELLERS PAGE 
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RETRIEVE BESTSELLER LISTS OF 
ALL BASE COMMUNITIES OF 
COMPOSITE COMMUNITY AND 
CONVERT PRODUCT COUNTS 
TO SCORES 



MERGE LISTS WHILE 
SUMMING SCORES OF 
LIKE PRODUCTS 



SORT RESULTING 
LIST BY SCORE 



FILTER OUT PRODUCTS TH.-. 
EXIST IN- BESTSELER LIS 
OF GLOBAL COMMUNITY 
(OPTIONAL) 



RETURN TOP Z ITEMS k 
FROM LIST 
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ACCESS PRODUCT- TO- MEMBERS 
MAPPING M- '.LE (FIG. 5) TO IDENTIFY 
OTHER USERS THAT HAVE 
PURCH/ -ED SAME PRODUCT. 



GET CL-nACT INFORMATION 
FOR EACH LOCATED USER 
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